
KM G^APHC iflC- presents the sourcerer 



A COMPREHENSIVE DISASSEMBLER FOR ADflO OBJECT CODE: The Sourcerer is designed 
to convert flOaO object code files to source code files-, which are easier to 
understand and modify- The code is systematically converted to Intel stan- 
dard mnemonics! and the created file is compatible with most floao assemblers-, 
including ESP-1 and others based on the Processor Technology assembler. 

CREATE ESP FILES DIRECTLY IN RAM: Files may be formed directly in memory-, 
or may be listed on any output terminal device- Several formats are avail- 
able-, including two with decimal line numbers and formatted constants and 
symbols- All symbols have a leading H followed by their hex equivalents i 
thus the assembler treats them as labels- All numerical constants have a 
leading zero and trailing H so they are treated as hexadecimal constants- 

SYMBOL TABLE: The Sourcerer operates in two passes. On the first pass-, 
all three-byte codes are used to form a hex symbol table- This table is 
used on the second pass to affix labels to the file- The symbol table 
may be listed separately. 

CROSS-REFERENCE TABLE: A cross-reference table may be listed- This table 
consists of each symbol-, the RAM addressed from where it is referenced-, and 
the mnemonic codes that reference it- 

The Sourcerer is a unique tool-, a must for the true software buff- It 
allows youi in a very short timei to create source files from all those 
machine code programs that you want to modify-, but are reluctant to hack 
up with haywire patches. Now you can create your own source file-, add 
commands-, change subroutines! or delete unused features- Your newly 
assembled program will be as efficient and compact as the original! 

To use the Sourcerer! you'll need a dedicated MK block of memory- The 
code itself occupies just over 5K with another short section devoted to 
scratchpads input buffer and stack. The remainder of the MK block is 
used for symbol table storage- 
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THE SOURCERER 

The SOURCERER is a comprehensive d i ssassembler designed to 
convert 8080 object code files to source code files, which are 
easier to understand and modify. The code is systematically 
converted to INTEL standard mnemonics, and the created file is 
compatible with most 8080 assemblers, including ALS-8, Imsai, 
ESP-1 and others based on the Processor Technology assembler. To 
achieve this, the SOURCERER operates in two separate passes, one 
to form a symbol table and the second to convert the code to 
mnemonics. The symbols are placed at the beginning of the 
appropriate lines as the second pass progresses. For maximum 
versatility, the two passes are controlled separately. fror 
several of the formats, the symbols are simply hex addresses. 
In those formats intended for use in an assembler, the symbol is 
preceded by an H, (example - H237F) so that the assembler will 
interpret them as labels, not numbers. 

The SOURCERER forms the symbol table by examining all the codes 
in a block of memory (specified by the command), collecting the 
last two bytes of all three-byte codes, sorting these 16 bit 
addresses in numerical order, and storing them in a file for 
future use. It is important to note that all these codes ; are 
treated as labels, even though some may be constants. D^ing 
reassembly, the constants may cause errors, but these must .be 
located the hard way; there is no way to distinguish a constant 
from a label in an object code file. A a im il " P^. 1 ^® 00 "^ 
with data tables buried in the object code; the SOURCERER will 
treat these as commands and attempt to decode them. These must 
be located and output using the D command (see description of 
commands below) to prevent errors. Finally, the assenbl er will 
require some labels that the SOURCERER will not locate. These 
usually occur when a data table is used for command decoding. 
It will be necessary after forming the file to locate these 
areas and type in the required labels. 

Despite the limitations listed above, the SOURCERER is a very 
powerful tool. The author has used it to create source files 




familiarity with its operation 
COMMAND DESCRIPTION 

There are 12 commands, consisting cf one le t t er each Many of 
the commands require addresses; these are typed directly 
following the commands in hexadecimal notation Comm * nd * !^ 
exerted by the carriage return, and may be edited or aborted up 
to the time of execution. 
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Prompt string - when properly loaded, the program will respond 
with a triangular bracket and space (> )'. This signifies that 
the program is awaiting a command. 



Entry and editing - Commands are typed in as sing 
a non-existent command letter is typed, it will b 
the program. The rejected letter will be printed 
underline character (hex 5F), which will erase 
terminals. Proper commands will not be erased 
fields are to be typed following the command, th 
hex. Any non-hex character will' be rejected. Lea 
not required. If more than four hex characters ar 
the last four will be used. Any character may 
typing an underline (5F). Address fields are 
spaces, but the space between the command and the 
optional. Any command may be aborted by Cont rol 
will be discussed in the order in which they are 
be used. 
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S BBBB EEEE - Form symbol table. (BBBB and EEEE are 
beginning and end of the code block to be accessed). The symbol 
table is formed and stored on this pass through the code. If 
there are tables or other non-code entries in the block, these 
will be interpreted as code and erroneous entries will be made. 
(Examining the symbol table for unusual addresses is one method 
for locating tables.) 



T - Display symbol table. Ou 
hex addresses. 0000 is alw 
this value to locate the begi 

is listed sequentially, 8 add 

f~ *-?, 

V ~* 
X BBBB EEEEI- Display cross-r 

every symbbl in the symbol 

mnemonic code of every loca 

This is very useful when pr 

when long programs are involv 

the user should be aware that 

example, printing the cross-r 

Jbj ke up to 2 hours. (Note: 

[""with this command, the first 

j to or higher than this third 



tputs symbol table as a series of 
ays output; the d issassembler uses 
nning of the symbol table. Table 
resses per line. 

eference table. This command lists 
table followed by the address and 
tion that references the symbol, 
ograms are being modified. However, 
ed , this command is very slow, and 

long times may be involved. As an. 
eference table for a 5K program can 

if a third address field is used 
symbol to be listed will be equal 
field.) 



F AAAA - Set RAM file location. Initializes the counters for 
creating files in RAM. If AAAA = 0, no file will be created. If 
F is typed without any addresses, the starting and ending 
address of the file will be displayed. Only two commands will 
cause a RAM file to be built, D (Data Word) and L (List Format). 
Other commands will not affect the RAM area. 

LLLL RRRR - Set location offset. If the program to be 



disassembled is not located 
command will compensate, 
code is located; RRRR 

runs . 



at it's usual running 



address, this 

LLLL is the address where the object 

is the address where the code normally 
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D BBBB EEEE - Data word format. Outputs code block as Intel 
pseudo-op DW. Useful for forming tables for the assembler. This 
command and the L command will form files in RAM if the ,JK^ 
command has been used to set a starting address. 

N BBBB EEEE - Mnemonic format. This command lists the program 
with hex addresses, the hex version of the code, the ASCII code 
equivalent, the label, mnemonic, arguments, and data fields, if 
applicable. ASCII equivalents for two and three byte codes are 
output on the same line. As with all formats (except L), the 
listing will appear with all labels, mnemonics, arguments, etc., 
vertically aligned for easy reading. The N command is 
recommended for preliminary decoding of a new program. 



Print form at 



This command 



dumps code with 
H, mnemonics, 



P BBBB EEEE - 

decimal line numbers, labels with leading ,, , 

arguments and data fields. Two-byte data fields will have a 

leading H to allow referencing to the labels. One-byte data 

fields will have a leading zero and a trailing H, so that the 

assembler will treat them as hex constants. This format is one 

of two that will load directly into an assembler. This 

particular format is useful only with short programs, as it does 

add a lot of extra spaces. However, it is very easy to read. 



L BBBB EEEE - List format. This command is the same as P except 
all the excess spaces have been deleted. This command should be 
used when dumping code onto paper tape for reload ing into an 
assembler. This command is also the principle command for 
creating a file in memory. 

R DDDD II - Reset line numbers. This command establishes the 
first decimal line number (DDDD) and the line number interval 
that will be output with the P, L and D commands. Both DDDD 
and II should be decimal numbers. This command initializes to a 
starting value of 1 and an interval of 1. 

G AAAA - Goto. Transfers program control to the specified 
address AAAA. This is a utility command included to allow 
operating several programs in memory simultaneously without the 
need for transfering with the front panel switches. 



I - Initializ 
time it is ace 
used to re- in 
return , the te 
TYPE Y TO IN 
initializatio 
deleted , the 
RAM file) and 
initial value 
dec imal count 
prevent inadve 



e. The SOURCERER is sel f- initializing the first 
essed. After this first initialization, I must be 
itialize. When I is typed, followed by a carriage 
rminal will respond with a message which reads: 
ITIALIZE. At this point, if a Y is typed, full 
n will occur. Any formed symbol table will be 
and F commands will be reset to zero (Offset and 
the decimal number counter will be reset to it's 
If any letter other than Y is typed, only the 
er will be reset. (The message is output to 
rtent loss of the symbol table.) 
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LOADING AND PATCHING THE SOURCERER 



The SOURCERER requires a ded icated JULJLLafiis of memory, The code 
itself occupies just over 2K , wi th another short section devoted 
to scratchpad, input buffer and stack. The remainder of the 4K 
block is used fo symbol table /storage. The table is built from 
the top down , thus even the smallest table will use the zone at 
the end of the 4K block. The/e is room in the table for about 
700 symbols, which is adequate for nearly any program you might 
want to disassemble. (A 12K object code will have just about 
700 symbols) . , , n . _, 

4- 2 'i L- 
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I/O PATCHING 



The SOURCERER is del 
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first number of the ver 
version, for the 
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X0C0. Again, four extr 
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Immediately following the input routine is the pause routine. 
This routine allows dumps or listings to be interrupted by the 
spacebar or aborted by control-C. At location X0D2 the keyboard 
data port is read (input). This location is delivered with hex 
code 01, (the TTY data port); change it to match your keyboard 
port. This completes I/O patching. The input/output and pause 
routines are listed on the next page. 



SOURCERER EXAMPLE 



A short program, designed to run at 3000 hex, has been loaded 
into memory at location 1000. The SOURCERER is loaded at 3000, 
thus an offset is required. A file of the program will be 
formed at hex 4000. The program code is 32 hex bytes long. 
Upper case letters represent the output when running the 
SOURCERER; lower case comments were added later for clarity. 



Hex Dump of program 



1000 
1010 
1020 
1030 



21 
C3 
03 

F0 



00 
03 
30 
C9 



00 
30 
DB 



CD 
DB 
27 



22 
FF 
E6 



30 
E6 
08 



DB 
80 
C2 



FF 
C2 
22 



E6 
2C 
30 



HO 
30 
DB 



CA 
CD 
26 



13 
03 
C9 



30 
F0 
3E 



CD 
77 
5F 



03 
23 
CD 



F0 
C3 
03 
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> I 

TYPE Y TO INITIALIZE 



Initialize Command. Not re- 
quired when first starting. 



> S 1000 1031 



Form Symbol Table. 



Display Symbol Table. 



> T 

SYMBOL TABLE 

0000 3003 3013 3022 302C F003 



> X 1000 1031 

0000 

1000 LXI H 



3003 
1010 JMP 

3013 
100A JZ 

3022 

1003 CALL 

302C 
1017 JNZ 

F003 
100D CALL 



101F JMP 



1026 JNZ 



101 A CALL 



Display Cross-Ref erence Table 



102E CALL 



> F 4000 

FILE 4000 4000 



Set File Starting Address. 



> 1000 3000 



Set offset - Program Runs at 
Address 3000 Hex 



> N 1000 1031 



! 



1000: 


21 


1003: 


CD 


1006: 


DB 


1008: 


E6 


100A: 


CA 


100D: 


CD 


1010: 


C3 


1013: 


DB 


1015: 


E6 



3"0 3003 
§ 



3013 



LXI 


H.0000 


CALL 


3022 


IN 


FF 


ANI 


40 


JZ 


3013 


CALL 


F003 


JMP 


3003 


IN 


FF 


ANI 


80 



Use N Command for First Pass. 



Note Hex Addresses, 
Hex Codes, ASCII. 
Labels are Offset. 
No Leading O's or 
H's for this Format. 
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1017: 
1 01 A : 

101D: 
101E: 
101F: 
1022: 
1024: 
1026: 
1029: 
102B: 
102C: 
102E: 
1031 : 



C2 

CD 
77 
23 
C3 
DB 
E6 
C2 
DB 
C9 
3E 
CD 
C9 



,0 



w 

# 



"0 

& 



3022 



302C 



JNZ 


302C 


CALL 


F003 


MOV 


M,A 


INX 


H 


JMP 


3003 


IN 


27 


ANI 


08 


JNZ 


3022 


IN 


26 


RET 




MVI 


A,5F 


CALL 


F003 


RET 





> F 

FILE 4000 4000 



Examine File Length; File 
not Formed by "N" Command. 



R 10 



Set Decimal Line Numbers to 
Start at Zero, Count by 10. 



> L 1000 1031 

0000 LXI H.HOO00 
0010 H3003 CALL H3022 
0020 IN OFFH 

0030 ANI 040H 

0040 JZ H3013 

0050 CALL HF003 

0060 JMP H3003 

0070 H3013 IN OFFH 

0080 ANI 080H 

0090 JNZ H302C 

0100 CALL HF003 

01 10 MOV M,A 
0120 INX H 
0130 JMP H3003 
0140 H3022 IN 027H 
0150 ANI 008H 
0160 JNZ H3022 
0170 IN 026H 
0180 RET 

0190 H302C MVI A.05FH 

0200 CALL HF003 

0210 RET 



List Format. This Format has 
Decimal Line Numbers, Proper 
Spacing, Leading H, to form 
Label From Hex Address Symbols 
Constants Have Leading and 
Trailing H. This File Also 
Formed in RAM at Hex 4000. 



> F 

FILE 4000 4175 



Examine File Length 



> D 1000 1008 



Dump a Little in DW Format 
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0220 


DW 


00021H 


0230 


DW 


0CD00H 


0240 


DW 


03022H 


0250 


DW 


OFFDBH 


0260 


DW 


040E6H 



> R 1 1 



Reset Decimal Line Numbers 



> P 1000 1008 



Use the Print Format. 



0001 




LXI 


H,H0000 


0002 


H3003 


CALL 


H3022 


0003 




IN 


OFFH 


0004 
> 




ANI 


04 OH 
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INPUT/OUTPUT ROUTINES FOR THE SOURCERER 



00B0 


F5 




00B1 


DB 


00 


00B3 


Eb 


80 


00B5 


C2 


Bl 


00B8 


Fl 




00B9 


D3 


01 


00BB 


C9 




00BC 






0H3C 






00C0 






HCS 


DB 


00 


00C2 


E6 


01 


00C4 


C2 


C0 


J30C7 


DB 


01 


00C9 


E6 


7F 


00CB 


C9 




0*)CC 






00CC 






00Dp 






i^0D^ 


F5 




00D1 


DB 


01 


00D3 


E6 


7F 



M 



pp 



07 35 


CRT1 


PUSH 


PSW 


0740 




IN 


0yi 


0745 




AMI 


80H 


0750 




JNZ 


CRT1+1 


0755 




POP 


PSW 


0760 




OUT 


01 


07b5 




RET 




0770 


JC 






0775 




DS 


4 


0780 


*\ 






0785 


KEYB 


IN 


00 


0790 




AN I 


01 


0795 




JN£ 


KEYB 


0800 




IN 


01 


0805 




AN I 


7FH 


0810 




RET 




0815 


** 






08 2 




Db 


4 


0825 


:: 






083P 


PAUSE 


PUSH 


PSW 


0835 




IN 


01 


084^ 




AMI 


7FH 



OUTPUT ROUTINE 



4 BYTES RESERVED 
FOR OUTPUT ROUTINE 

INPUT ROUTINE 



4 3YTES RESERVED 
FOR INPUT ROUTINE 

PAUSE ROUTINE 
READ DATA PORT 



